Computer  Science 


■■  U..,-- 


1^5 


^SFECTEDS 


Bibliography  of  Research  in 
Natural  Language  Generation 


rtk' „ 

L*  •  i  y  ■’ 


Marie  Kantrowitz 
November  1993 
CMU-CS-93-216 


i  , 

i 

f: 


ELECTE 
DEC2  81993 


•  ddlu^tli:  bes  s»ea  opprovad 
f^l^^easa  and  sals,  its 
nbutios  IS  anlinif  d 


\  ^ 


\ 

.v»»> 


Mellon 


HL 


93-31348 

Mftm  iftiiB  MtM  " 


98  l2  2?  0  8  0 


Best 

Available 

Copy 


Dnc  QUALITY  INSPECTED  5 


Bibliography  of  Research  in 
Natural  Language  Generation 


Mark  Kantrowitz 
November  1993 
CMU-CS-93-216 


TIC 


r-:i..eCTE 

81993 


School  of  Computer  Science 
Carnegie  Mellon  University 


Abstract 

This  document  provides  a  12004'  entry  bibliography  of  natural  language  generation  research.  The  BibTeX 
sources  for  the  Ubliography  are  available  by  anonymous  ftp. 
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1.  Introduction 


This  provides  a  comprehensive  bibliography  of  Natural  Language  Generation  research.  The 

bibliography  includes  over  1^00  BibTeX-fonnated  entries  and  is  available  free  by  anonymous  ftp. 

When  compiling  this  Ubiiography,  we  began  by  iiKluding  relevant  references  from  several  PhD  the¬ 
ses,  bodts,  crafereroes  and  workslu^  including  the  natural  language  generation  workshops  (TWNLGS, 
ENLGWS),  natural  language  processing  conferences  (ANLP,  TINLAP,  SPEECH),  artificial  intelligence 
conferences  (AAAI,  SCAI,  ECAI,  GWAI,  UCAI,  CAIA),  and  computational  linguistics  and  cognitive  sci¬ 
ence  conferences  (ACL,  COLING,  EACL,  COGSCI).  Tracing  backwards  from  these  references,  we  filled 
in  some  of  the  holes  in  the  bibliography.  Where  possible,  we*ve  consulted  the  original  source  to  verify  the 
accuracy  of  the  citations.* 

In  any  undertaking  of  this  magnitude,  errors  and  omissions  are  hard  to  avoid.  We  apologize  in  advance 
to  any  author  who  has  been  overlooked,  and  welcome  corrections  and  additions.  We  intend  to  continue 
updating  the  BibTeX  database.  Please  send  updates  by  E-mail  to  mkant  0  cs .  emu .  edu. 

We  originally  intended  to  produce  an  armotated  and  cross-indexed  guide  to  the  NLG  literature,  but  ran 
out  of  time.  The  field  is  growing  so  fast  that  if  we  were  to  wait  until  the  guide  was  finished,  it  would  already 
be  obsolete. 


2.  Obtaining  the  BibTeX  Bibliography  by  FTP 

The  bibliogn^hy  is  maintained  in  the  form  of  a  BibTeX  file.  To  obtain  the  bibliography,  connect  by 
anonymous  ftp  to  ftp.cs.cmu.edu  [128.2.206.173]  using  username  “anonymous”  and  password 
“ncuneShost”  (your  email  address).  The  file  nlg-bib.tar  .gz,  a  gzipped^  tar  file,  is  located  in  the 
directory 

user/ai/software/nlp/nlg/bib/mk/. 

and  includes  the  BibTeX  file  nig .  bib  and  this  report  nlg-bib .  t  ex.  If  your  site  runs  the  Andrew  File 
Systran,  you  can  find  the  file  in  the  AFS  directory 

/afs/cs .cmu.edu/pro ject/ai-repository/ai/software/nlp/nlg/bib/mk/. 


3.  Overviews  of  NLG  Research 


There  have  been  several  good  surveys  of  work  in  natural  language  graieration.  Mann  et  al  [698]  reviews 
work  prior  to  1982.  Nfore  recent  (and  also  more  technical)  overviews  may  be  found  in  McDonald’s  article  in 


'We  foond  a  snipiising  number  of  incoirect  citations  in  the  reference  sections  of  these  papers.  Incorrect  ciutions  that  appeared 
in  one  paper  tended  to  propagate  to  later  papers. 

^GZn*  is  a  patent-&ee  compression  program  that  achieves  a  betto’  compression  ratio  than  COMPRESS.  It  is  available  free  by 
anonymous  fkp  from  prep .  al .  mlt .  edu  [18.71.038]  in  the  pub/gnu/  directoiy. 
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the  Encyclopedia  of  Aitificial  Intelligence  [774. 782].  McKeown  and  Swaitout’s  article  in  the  1987  Annual 
Review  of  Computer  Science  [808]  (also  reprinted  in  Zock  and  Sabah’s  book  [809]).  and  Kempen’s  survey 
[594].  Robin  [991]  and  Gumming  [217]  summarize  work  on  lexical  selection.  Marui  [696]  and  Joshi  [553] 
discuss  the  differences  between  the  synthesis  and  analysis  of  natural  language.  Hovy  gives  an  overview  of 
trends  in  research  on  the  discourse  structure  of  monologic  text,  focusing  on  the  structuralist  and  functionalist 
approaches  [486]. 

For  a  survey  of  work  on  expert  system  explanation,  which  is  a  distinct  and  fairly  large  subfield  of  natural 
language  generation,  consult  Nfoore  and  Swartout  [858].  The  characteristics  of  a  good  explanation  facility 
are  discussed  in  Nfoore  and  Paris  [857].  Suthers  surveys  explanation  planning  mechanisms  [1104]. 

For  a  survey  of  work  on  machine  translation,  see  Slocum’s  article  in  Computational  Unguistics  [1080]. 

Summaries  of  recent  work  and  current  issues  in  natural  language  generation  also  appear  in  Hovy  and 
McCoy  [492].  Mykoudecka  [868].  and  Fedder  [323].  as  well  as  in  the  proceedings  of  the  natural  language 
generation  workshops. 

Since  1983  there  have  been  regular  international  workshops  on  natural  language  generation,  and  European 
workshops  since  1987.  The  proceedings  from  some  of  these  workshops  have  been  published  in  book  form: 


International  NLG  Workshops 

a 

Year 

Location 

References 

n 

■Eia 

Stuttgart,  Germany 

Q 

1984 

Stanford,  California 

a 

1986 

Nijmegen,  The  Netherlands 

[593] 

4" 

1988 

Los  Angeles,  California 

[917] 

5 

1990 

Pittsburgh,  Perutsylvania 

[923, 156] 

m 

1992 

Trento,  Italy 

[229] 

European  NLG  Workshops 

D 

Year 

Location 

References 

1987 

Abbey  de  Royaumont,  France 

[1215, 1216] 

Q 

1989 

Edinburgh 

[230] 

a 

1991 

Iimsbruck,  Austria 

[468] 

Perrault  and  Grosz’s  article  in  the  1986  Annual  Review  of  Computer  Science  [937]  surveys  work  on 
natural  language  interfaces.  See  also  Rich  [985].  NLI  work  is  relevant  to  NLG.  but  most  work  in  NLI 
focuses  on  the  understanding  end  of  the  interface.  Simmons  [1072]  is  a  much  older  survey  of  work  on 
question-answering. 
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